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Abstract 

We addressed the problem of detecting the change 
in behavior of information diffusion from a small 
amount of observation data, where the behavior 
changes were assumed to be effectively reflected in 
changes in the diffusion parameter value. The prob- 
lem is to detect where in time and how long this 
change persisted and how big this change is. We 
solved this problem by searching the change pattern 
that maximizes the likelihood of generating the ob- 
served diffusion sequences. The naive learning al- 
gorithm has to iteratively update the patten bound- 
aries, each requiring optimization of diffusion pa- 
rameters by the EM algorithm, and is very ineffi- 
cient. We devised a very efficient search algorithm 
using the derivative of likelihood which avoids pa- 
rameter value optimization during the search. The 
results tested using three real world network struc- 
tures confirmed that the algorithm can efficiently 
identify the correct change pattern. We further 
compared our algorithm with the naive method 
that finds the best combination of change bound- 
aries by an exhaustive search through a set of ran- 
domly selected boundary candidates, and showed 
that the proposed algorithm far outperforms the na- 
tive method both in terms of accuracy and compu- 
tation time. 

1 Introduction 

Social networking is now an important part 
of our daily lives, and our behavioral patterns 
are substantially affected by the communica- 
tion through these networks ||Newman et al., 2002[ 



Newman, 2003] |Gruhl etal, 20041 |Domingos, 2005 



Leskovec et al., 20061. It has been shown that a social 



network has many interesting properties, e.g. power law for 
node degree distribution, large clustering coefficient, positive 
degree correlation, etc. | Wasserman and Faust, 1994] , which 
affect how the information actually diffuses through the 
network, and researchers have devised several important 
measures to characterize these features based on the topol- 
ogy/sti'uct ure of t he network | |Wasserman and Faust, 1994| 
[Bonacichi, 1987t |Katz, 19531 . These measures, called 



centrality measures, are expected to be used to iden- 
tify important nodes in the network. However, recent 
studies have shown that it is important to consider 
the diffusion mechanism explicitly and the measures 
based on network structure alone are not enough 
to identify the important nodes | |Kimura et al, 20091 
IKimura et al., 201 0a^. 

Information diffusion is modeled typically by 
a probabilistic model. Most representative and 

fundamental ones are independent cascade (IC) 
model iGoldenbergef a/., 2001 Kempe et al., 2003), 
linear threshold (LT) model l lWatts, 2002i 
IWatts and Dodds, 2007| and their extensions that include 
incorporating asynchronous time delay |Saito et al., 2009| . 
Explicit use of these models to solve such problems as 



the influence maximization problem I Kempe et al., 2003 
IKim ura ef g/ ., 2010a) and t he contamination minimization 
problem [Kimura et al, 2009 j clearly shows the advantage of 
the model. The identified influential nodes and links are con- 
siderably different from the ones identified by the centrality 
measures. However, use of these models brings in yet another 
difficulty. They have parameters that need be specified in 
advance, e.g. diffusion probabilities for the IC model, and 
weights for the LT model, and their true values are not 
known in practice. A series of studies by ySaito et al, 20091 
ISaito et al, 2 010] have shown one way of solving this 
problem in which they used a limited amount of observed 
information diffusion data and trained/learned the model 
such that the likelihood of generating the observed data by 
the model is maximized. 

This paper is in the same line of these studies, but addresses 
a different aspect of information diffusion. Almost all of the 
work so far assumed that the model is stationary. We note 
that our behavior is affected not only by the behaviour of our 
neighbors but also by other external factors. The model only 
accounts for the interaction with neighbors. The problem we 
address here is to detect the change of the model from a lim- 
ited amount of observed information diffusion data. If this is 
possible, we can infer that something unusual happened dur- 
ing a particular period of time by simply analyzing the limited 
amount of data. 

This is in some sense the same, in the spirit, with the work 
by | |Kleinberg, 2002) and |Swan and Allan, 20001. They 
noted a huge volume of the data stream, tried to organize it 



and extract structures behind it. This is done in a retrospec- 
tive framework, i.e. assuming that there is a flood of abundant 
data already and there is a strong need to understand it. Our 
aim is not exactly the same as theirs. We are interested in de- 
tecting changes which is hidden in the data. We also follow 
the same retrospective approach, i.e. we are not predicting 
the future, but we are trying to understand the phenomena 
that happened in the past. There are many factors that bring 
in changes and the model cannot accommodate all of them. 
We formalize this as the unknown changes in the diffusion pa- 
rameter value, and we reduce the problem to that of detecting 
where in time and how long this change persisted and how 
big this change is. We call the period where the parameter 
takes anomalous values as "hot span" and the rest as "nor- 
mal span". To make the analysis simple, we limit the diffu- 
sion model to the asynchronous independent cascade model 
(AsIC) I S aito et ah, 2009| and the form of change to a rect- 
linear one, that is, the diffusion parameter changes to a new 
large value, persists for a certain period of time and is re- 
stored to the original value and stays the same thereafter Q 
In this simplified setting, detecting the hot span is equivalent 
to identifying the time window where the parameter value is 
high and estimating the parameter values both in hot and nor- 
mal spans. 

To this end, we use the same parameter optimization algo- 
rithm as in [Saito et ah, 2 0091, i.e. the EM algorithm that iter- 
atively updates the values to maximize the model's likelihood 
of generating the observed data sequences. The problem here 
is more difficult because it has another loop to search for the 
hot span on top of the above loop. The naive learning algo- 
rithm has to iteratively update the patten boundaries requir- 
ing the parameter value optimization for each combination, 
which is a very inefficient procedure. Our main contribution 
is that we devised a very efficient general search algorithm 
which avoids the inner loop optimization by using the infor- 
mation of the first order derivative of the likelihood with re- 
spect to the diffusion parameters. We tested its performance 
using the structures of three real world networks (blog, Coau- 
thorship and Wikipedia), and confirmed that the algorithm 
can efficiently identify the hot span correctly as well as the 
diffusion parameter values. We further compared our algo- 
rithm with the naive method that finds the best combination 
of change boundaries by an exhaustive search from a set of 
randomly selected boundary candidates, and showed that the 
proposed algorithm far outperforms the native method both 
in terms of accuracy and computation time. 

2 Information Diffusion Model 

The AsIC model we use in this paper incorporates asyn- 
chronous time delay into the independent cascade (IC) model 
which does not account for time-delay, reflecting that each 
node changes its state asynchronously in reality. We recall 
the definition of the AsIC model below, in which we con- 
sider choosing a delay-time from the exponential distribution 



'We discuss that the basic algorithm can be extended to more 
general change patterns in Section (6] and shows that it works for 
two distinct rect-linear patterns. 



for the sake of convenience, but of course other distributions 
such as power-law and Weibull can be employed. 

Let G - (V,E) be a directed graph, where V and E (c 
y X y) are the sets of all the nodes and the links. For any 
V € V, the set of all the nodes that have links from v is denoted 
by F{v) - {u eV; iy, u) e E] and the set of all the nodes that 
have links to v by B(v) - [u eV; (u, v) e E]. Each node has 
one of the two states (active and inactive), and the nodes are 
called active if they have been influenced. It is assumed that 
nodes can switch their states only from inactive to active. 

The AsIC model has two types of parameters ,, and r„,i, 
with < pu,v < 1 and r,,,,, > 0, where and r,,.,. are referred 
to as the diffusion probability through link (u, v) and the time- 
delay parameter through link (u, v), respectively. The infor- 
mation diffusion process unfolds in continuous-time t, and 
proceeds from a given initial active node in the following way. 
When a node u becomes active at time t, it is given a single 
chance to activate each currently inactive node v € F(u). A 
delay-time 6 is chosen from the exponential distribution with 
parameter r„ ,,. The node u attempts to activate the node v 
if V has not been activated by time t + 6, and succeeds with 
probability /9„.,,. If u succeed, v will become active at time 
t + 6. The information diffusion process terminates if no more 
activations are possible. 

3 Problem Setting 

We address the hot span detection problem. In this problem, 
we assume that some change has happened in the way the in- 
formation diffuses, and we observe the diffusion sequences of 
a certain topic in which the change is embedded, and consider 
detecting where in time and how long this change persisted 
and how big this change is. We place a constraint that 
and r„ ,, do not depend on link (m, v), i.e. /?„.,, - p, ru^v - f 
(V(m, v) e E), which should be acceptable noting that we 
can naturally assume that people behave quite similarly when 
talking about the same topic (see Section|6]l. 

Let [Ti, T2] denote the hot span of the diffusion of a topic, 
and let pi and p2 denote the values of the diffusion probabil- 
ity of the AsIC model for the normal span and the hot span, 
respectively. Note that < p2- A diffusion result of the topic 
is represented as a set of pairs of active nodes and their acti- 
vation times; i.e. {(u, f„), (v, f,,), • ■ ■ }. We consider a diffusion 
result D that is generated by the AsIC model with pi for the 
period [0, Ti), p2 for the period [Ti, T2] and pi for the period 
{T2, 00), where the time-delay parameter does not change and 
takes the same value r for the entire period [0, 00). We refer 
to the set Z) as a diffusion result with a hot span. The prob- 
lem is reduced to detecting [Ti, T2] and estimating pi and p2 
from the observed diffusion results. Extensions of this prob- 
lem setting is discussed later (see Sectiorj6]l. 

Figure [T] shows examples of diffusion samples with a hot 
span based on the AsIC model, where the parameters are set 
at pi = 0.1, p2 = 0.3, r = 1.0, Ti = 10 and T2 = 20. 
The network used is the blog network described later in Sub- 
section |5T| We plotted the ratio of active nodes (the num- 
ber of active nodes at a time step t divided by the number 
of total active nodes over the whole time span) for five in- 
dependent simulations, each from a randomly chosen initial 
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Figure 1: Information diffusion in the blog network with a 
hot span for the AsIC model. 

source node at time f = 0. We can clearly see bursty activities 
around the hot span [Ti - 10, T2 = 20]. However, each curve 
behaves differently, i.e., some has its bursty activities only 
in the first half, some other has them only in the last half, 
and yet some other has two peaks during the hot span. This 
means that it is quite difficult to accurately detect the true hot 
span from only a single diffusion sample. Methods that use 
only the observed bursty activities, including those proposed 
by l iSwan and Allan, 2000] and [Kleinberg, 2002 ] would not 
work. We believe that an explicit use of underlying diffusion 
model is essential to solve this problem. It is crucially impor- 
tant to detect the hot span precisely in order to identify the 
external factors which caused the behavioral changes. 

4 Hot Span Detection IVIethods 

Let {D„; m = 1, • ■ • ,M] be a set of M independent infor- 
mation diffusion results, where D„, = {(u, tm^u), (v, f,„,,,), • ■ • }. 
Each D„, is associated with the observed initial time (p^ - 
minjf,,,,,,; (v, fm,v) e D„i], and the observed final time Q>m > 
max{f,„ ,,; (v, f,„_y) e £),„}. We express our observation data by 
2)m = {{Dm, <I>m); m = 1, ■ ■ • , M}. For any t e [0,„, (&,„], we 
set Cm(f) - {v; (v, ?,„_,,) € D,„, f„,_,, < f). Namely, C„,(t) is the 
set of active nodes before time t in the mth diffusion result. 
For convenience sake, we use C,,, as referring to the set of all 
the active nodes in the mth diffusion result. 

4.1 Parameter Learning Framework 

The following logarithmic likelihood function £,(T>m\ p, r) 
has been derived to estimate the values of p and r 
from Dm for the AsIC model in case there is no hot 
span ISaitoef a/.,2009) , 



Z{DM\p,r) = J]i:((£»„,4>J;/7,r) 



m=l 
M 

m— 1 vgC,,, 



l0g/Zm,„ + ^ logg„: 



M'eF(i')\C,„ 



(1) 



where /!,„,,, is the probability density that a node v e D,„ with 
tm,v > is activated at a time f^.v, and §,„.i',w is the probability 
that a node w is not activated by a node v within [<;*,„, O,,,], 
where there exists a link (v, w) e £ and v 6 Cm- The values of 



p and r can be stably obtained by maximizing Eq. ([T]l using 
the EM algorithm | |Saito ef al, 2009| . 

The following parameter switching applies for a hot span 
S - {T\,T2\ where N„, and 'H„, denote the sets of active 
nodes in the m-th diffusion result during the normal and the 
hot spans, respectively. 



px if V e N,„(S), N„(S) = CJTi) U (C„, \ CJT2)), 
P2 if V € 'H„(S) = CJT2) \ C„(Ti). 



Then, an extended objective function X-ii^M', Pi, P2, f,S) 
can be defined by adequately modifying Eq. ([T]l under this 
switching scheme. Clearly, £,{!Dm', Pi, Pi, f',S)is expected to 
be maximized by setting S to the true span S* - [TpTj] 
if a substantial amount of data !Dm is available. Thus, our 
problem is to find the following S . 



arg max -C(Dm; p\,p2,f,S), 



(2) 



where p\, p2, and f denote the maximum likelihood estima- 
tors for a given S . 

In order to obtain S , we need to prepare a reasonable set 
of candidate spans, denoted by S. One way of doing so is 
to construct S by considering all pairs of observed activation 
time points: S — {S — [fi, fi] : fi < f2, h e 7", f2 e 7"), where 
7~ = {fi, • • ■ , is a set of activation time points in Dm- 

4.2 Naive Method 

Now we describe the naive method, which has two iterative 
loops. In the inner loop we first obtain the maximum likeli- 
hood estimators, pi, p2, and r, for each candidate S by max- 
imizing £,{Dm', Pi, P2,r,S) using the EM algorithm. In the 
outer loop we select the optimal S which gives the largest 
XXDm', Pi, Pi, r, 5) value. However, this can be extremely in- 
efficient when is large. To make it work with a reasonable 
computational cost, we restrict the number of candidate time 
points to a smaller value K by selecting K points from T, 
i.e., we construct >S a: - {S - [fi,?!] : h < f2,fi e Tk,t2 G 
Tk], where = (fi, ■ ■ ■ , Ik}- Note that \Sk\ = K{K - l)/2, 
which is large when K is large. 

4.3 Proposed Method 

The naive method should be able to detect the hot span with a 
reasonable accuracy when K is set large at the expense of the 
computational cost, but the accuracy becomes poorer when K 
is set smaller to reduce the computational load. We propose 
a novel detection method which alleviates this problem and 
can efficiently and stably detect a hot span from Dm- 

We first obtain p, and f, based on the original objective 
function of Eq. ([TJ, and focus on its first-order derivative with 
respect to p for each node at each individual activation time. 
Let pu y be the diffusion parameter from a node m to a node 
V. The following formula holds for the maximum likelihood 
estimators due to the uniform parameter setting of Eq. ^ and 
the locally optimal condition. 



d£(DM;p,r) 
dp 



- E 



dZ(DM;p,r) 
dpu,v 



= 0. 



(3) 



Consider the following partial sum for a given S = [Ti, T2]- 



m=l (u,v)£E,u£'H„,(S) 



dpu,v 



(4) 



Clearly, 0(S) should be sufficiently large if S a S* due to 
our problem setting, which leads to p2 > p > p\- Thus, the 
hot span S * can be estimated by searching for S that maxi- 
mizes 0(S). 



argmax 0(5). 

SeS 



(5) 



The nice thing here is that we can incrementally calculate 
0(S) by Eq. (|6]l, where T - {ti, ■ ■ ■ ,1^} and f,- < tj if ; < j. 



dmD„,,<i>J;p,f) 



m=\ (u,v)eE 
iieC,„(tj+i)\C,„(tj) 



dpu,v 



■(6) 



The computational cost for examining each candidate span 
is much smaller than the naive method described above. 
Thus, we can use all the pairs to construct S. We summa- 
rize our proposed method below. 

1. Maximize X(2Dm; p, r) by using the EM algorithm. 

2. Construct T and S. 

3. Detect S by Eq. (|5]l and output S . 

4. Maximize £.{Dm', Pi, pi, r,S) by using the EM algorithm, 

and output p\, pi, and r. 

Here note that the proposed method requires maximization by 
using the EM algorithm only twice. 

5 Experiments 

We experimentally investigated how accurately the proposed 
method can estimate both the hot span and the diffusion prob- 
abilities in the hot and normal spans, as well as its efficiency, 
by comparing it with the naive method using three real world 
networks. We used three different values for K, i.e., K - 5, 
10, and 20 for the naive method. 

The derivation assumed that there are multiple observed 
data sequences, but in the experiments we chose to learn from 
a single sequence, i.e., M - 1, which is the most difficult 
situation. 

5.1 Datasets 

The three data are all bidirectionally connected networks. 
The first one is a trackback network of Japanese blogs used 
in HKimuraef a/.,2009[ , which has 12, 047 nodes and 79, 920 
directed links (the blog network). The second one is a 
coauthorship network used in | |Palla et al, 2005| , which has 
12,357 nodes and 38,896 directed links (the Coauthorship 
network). The last one is a network of people that was derived 
from the "fist of people" within Japanese Wikipedia, used in 
I IKimuraef aC2 009l, and has 9,481 nodes and 245,044 di- 
rected links (the Wikipedia network). 

For these networks, we generated diffusion samples 
with a hot span using the AslC model. According to 
I Kempe et al, 2003"[, we set the diffusion probability for the 



normal span, pi, to be a value smaller than Ijd, where d is 
the mean out-degree of a network, and set the diffusion prob- 
ability for the hot span, p2, to be three times larger than p\. 
Thus, p\ and p2 are 0.1 and 0.3 for the blog network, 0.2 and 
0.6 for the Coauthorship network, and 0.02 and 0.06 for the 
Wikipedia network, respectively. We fixed the time-delay pa- 
rameter at 1 (r = 1) for all the networks because changing r 
works only for scaling the time axis of the diffusion results. 
We set the hot span to [T\ - 10, T2 - 20] based on the ob- 
servation on the preliminary experiments. In all we gener- 
ated five information diffusion samples using these parameter 
values for each network, randomly selecting an initial active 
node for each diffusion sample. 

5.2 Results 

We compared the proposed method with the naive method 
in terms of 1) the accuracy of the estimated hot span S = 
[7^1, T2], 2) the accuracy of the diffusion probabilities pi (for 
the normal span) and p2 (for the hot span), and 3) the com- 
putation time. Both the proposed and the naive methods were 
tested to each diffusion sample mentioned above, and the re- 
sults were averaged over the five independent trials for each 
network. 

Figure |2] shows the accuracy for S in the absolute error 
&s = - T^il + ITi - T2\. We see that the proposed method 
achieves a good accuracy, much better than the naive method 
for every network. As expected, fij for the naive method 
decreases as K becomes larger But, even in the best case 
(K - 20), its average error is about 3 to 10 times larger than 
that of the proposed method. Figure [3] shows the accuracy of 
pi and/?2 in the relative error £p = \pi - Pi\/ Pi + \P2- Pil/ Pi- 
Here again, the average relative error for the naive method 
decreases as K becomes larger However, even in the best 
case (K - 20), it is about 2 to 3 times larger than that of 
the proposed method. We note that the average errors for 
the Coauthorship network are relatively large. This is be- 
cause the number of active nodes within the normal span was 
relatively small for this network. Figure |4] shows the com- 
putation time. It is clear that the proposed method is much 
faster than the naive method. The significant difference is at- 
tributed to the difference in the number of runs of the EM 
algorithm. The proposed method executes the EM algorithm 
only twice: steps 1 and 4 in the algorithm (see Section |43] |. 
On the other hand, the naive method has to execute the EM 
algorithm once for every single candidate span S € Sk which 
is \Sk\ = K{K - l)/2 times (see Section |4~2] |. Indeed, the 
computation time of the naive method for K = 5 is about 
5 times larger for every network, which is consistent with 
\Sk\ - 10. This relation roughly holds also for the other 
two cases (K - 10 and K - 20). This means that even if 
the naive method could achieve a good accuracy by setting 
K to a sufficiently large value, it would require unacceptable 
computation time for such a large K. 

In summary, we can say that the proposed method can 
detect and estimate the hot span and diffusion probabili- 
ties much more accurately and efficiently compared with the 
naive method. Here we mention that we could obtain much 
better results by using more than one diffusion sequence, say 
M = 5, but we have to omit the details due to space hmita- 
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Figure 2: Comparison in accuracies of the estimated hot span 
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Figure 3: Comparison in accuracies of the estimated diffusion probability 
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tions. 

6 Discussion 

We placed a simplifying constraint that the parameters p„^y 
and r„_,, are link independent, i.e. /7„_,, = p, ru,v - r 
(V(m, v) 6 E), by focusing on single topic diffusion se- 
quences. ISaito et al, 2009[|Saito et al, 2010 ] gave some ev- 
idences for this assumption. They examined 7,356 diffu- 
sion sequences for a real blogroll network containing 52, 525 
bloggers and 115, 552 blogroll links, and experimentally con- 
firmed that p and r that were learned from different diffusion 
sequences belonging to the same topic were quite similar for 
most of the topics. This observation naturally suggests that 
people behave quite similarly for the same topic. 

In this paper, we considered AsIC model, but it is 
straightforward to apply the same technique to AsLT 
model | |Saito et al., 2010| and to their SIS versions in which 
each node is allowed to be activated multiple times. The same 
idea can naturally be applied to opinion formation model, e.g. 
value- weighted voter model I jKimura et al, 2010b| . 

The change pattern considered here is the simplest one. We 
can assume a more intricate problem setting such that both p 
and r change for multiple distinct hot spans and the shape 
of change pattern p is not necessarily rect-linear. One possi- 



ble extension is to approximate the pattern of any shape by 
J pairs of time interval each with its corresponding pj, i.e., 
Zy = [i[tj-\,ti\,Pj); i = \, - J} (to = 0,tj = oo) and use 
a divide-and-conquer type greedy recursive partitioning, still 
employing the derivative of the likelihood function ^ as the 
main measure for search. More specifically, we first initial- 
ize Z[ - {([Q,o°), pi)] where pi is the maximum likelihood 
estimator, and search for the first change time point f i , which 
we expect to be the most distinguished one, by maximizing 
\&i[t, °°)>.Pi)l0 We recursively perform this operation 7 times 
by fixing the previously determined change points. When to 
stop can be determined by a statistical criterion such as AIC 
or MDL. This algorithm requires parameter optimization J 
times. Figure |5]is one of the preliminary results obtained for 
two distinct rect-linear patterns using five sequences (M = 5) 
in case of the blog network. MDL is used as the stopping cri- 
terion. The change pattern of p is almost perfectly detected 
with respect to both pj and tj {J - 5). 

7 Conclusion 

In this paper, we addressed the problem of detecting the 
change in behavior of information diffusion from a limited 
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Figure 5: Information diffusion in the blog network with two 
hot spans for the AsIC model. 

amount of observed diffusion sequences in a retrospective set- 
ting, assuming that the diffusion follows the asynchronous 
independent cascade (AslC) model. We defined the "hot 
span" as the period during which the diffusion probability is 
changed to a relatively high value compared with the other 
periods (called the normal spans). A naive method to detect 
such a hot span would have to iteratively update the candi- 
date hot span boundaries, each requiring parameter optimiza- 
tion such that the likelihood function is maximized. This is 
very inefficient and totally unacceptable. We developed a 
novel and general framework that avoids the inner loop op- 
timization during search by making use of the first deriva- 
tive of the likelihood function. It needs to optimize the pa- 
rameter values only twice by the iterative updating algorithm 
(EM algorithm), which reduces the computation times by 5 
to 100 times, and is very efficient. We compared the pro- 
posed method with the naive method that considers only the 
randomly selected boundary candidates, by applying both the 
methods (the proposed and the naive) to information diffu- 
sion samples generated by simulation from three real world 
large networks, and confirmed that the proposed method far 
outperforms the naive method both in terms of accuracy and 
efficiency. Although we assumed a very simplified problem 
setting in this paper, the proposed method can be easily ex- 
tended to solve more intricate problems. We showed one pos- 
sible direction and the preliminary results obtained for two 
rect-linear shape hot spans was very promising. Our imme- 
diate future work is to evaluate our method using real world 
information diffusion samples with hot spans, as well as to 
deal with spatio-temporal hot span detection problems using 
more appropriate stochastic models under a similar problem 
solving framework. 

Acknowledgments 

This work was partly supported by Asian Office of Aerospace 
Research and Development, Air Force Office of Scientific Re- 
search under Grant No. AOARD-10-4053, and JSPS Grant- 
in-Aid for Scientific Research (C) (No. 23500194). 



References 

[Bonacichi, 1987] P. Bonacichi. Power and centrality: A 
family of measures. Amer. J. Social., 92:1170-1 182, 1987. 

[Domingos, 2005] P. Domingos. Mining social networks for 
viral marketing. IEEE Intell. Syst, 20:80-82, 2005. 

[Goldenberg ef flZ., 2001] J. Goldenberg, B. Libai, and 
E. Muller Talk of the network: A complex systems look 
at the underlying process of word-of-mouth. Market. Lett., 
12:211-223,2001. 

[Gruhlef flZ.,2004] D. Gruhl, R. Guha, D. Liben-Nowell, 
and A. Tomkins. Information diffusion through blogspace. 
SIGKDD ExpL, 6:43-52, 2004. 

[Katz, 1953] L. Katz. A new status index derived from so- 
ciometric analysis. Sociometry, 18:39-43, 1953. 

[Kempe et al, 2003] D. Kempe, J. Kleinberg, and E. Tardos. 
Maximizing the spread of influence through a social net- 
work. In KDD 2003, pages 137-146, 2003. 

[Kimuraef flZ., 2009] M. Kimura, K. Saito, and H. Motoda. 
Blocking links to minimize contamination spread in a so- 
cial network. ACM Trans. Knowl. Discov. Data, 3:9:1- 
9:23, 2009. 

[Kimura et al, 2010a] M. Kimura, K. Saito, R. Nakano, and 
H. Motoda. Extracting influential nodes on a social net- 
work for information diffusion. Data Min. Knowl. Disc, 
20:70-97, 2010. 

[KimuraefflZ., 2010b] M. Kimura, K. Saito, K. Ohara, and 
H. Motoda. Learning to predict opinion share in social 
networks. In AA4/-70, pages 1364-1370,2010. 

[Kleinberg, 2002] J. Kleinberg. Bursty and hierarchical 
structure in streams. In KDD 2002, pages 91-101, 2002. 

[Leskovec et al, 2006] J. Leskovec, L. A. Adamic, and B. A. 
Huberman. The dynamics of viral marketing. In EC'06, 
pages 228-237, 2006. 

[Newman ef a/., 2002] M. E. J. Newman, S. Forrest, and 
J. Balthrop. Email networks and the spread of computer 
viruses. Phys. Rev. E, 66:035101, 2002. 

[Newman, 2003] M. E. J. Newman. The structure and func- 
tion of complex networks. SIAM Rev., 45: 167-256, 2003. 

[Palla et al, 2005] G. Palla, I. Derenyi, I. Farkas, and T. Vic- 
sek. Uncovering the overlapping community structure of 
complex networks in nature and society. Nature, 435:814- 
818, 2005. 

[Saito efflZ., 2009] K. Saito, M. Kimura, K. Ohara, and 
H. Motoda. Learning continuous-time information dif- 
fusion model for social behavioral data analysis. In 
ACML 2009, pages 322-337, 2009. 

[Saito effl/., 2010] K. Saito, M. Kimura, K. Ohara, and 
H. Motoda. Selecting information diffusion models 
over social networks for behavioral analysis. In ECML 
PKDD 2010, pages 180-195, 2010. 

[Swan and Allan, 2000] R. Swan and J. Allan. Automatic 
generation of overview timelines. In SIGIR 2000, pages 
49-56, 2000. 



[Wasserman and Faust, 1994] S. Wasserman and K. Faust. 
Social network analysis. Cambridge Univ. Press, Cam- 
bridge, UK, 1994. 

[Watts and Dodds, 2007] D. J. Watts and P. S. Dodds. Influ- 
ence, networks, and public opinion formation. J. Consum. 
Res., 34:441-^158, 2007. 

[Watts, 2002] D. J. Watts. A simple model of global cascades 
on random networks. PNAS, 99:5766-5771, 2002. 



